ARF @ MediaEval 2012: Multimodal Video Classification

نویسندگان

  • Bogdan Ionescu
  • Ionut Mironica
  • Klaus Seyerlehner
  • Peter Knees
  • Jan Schlüter
  • Markus Schedl
  • Horia Cucu
  • Andi Buzo
  • Patrick Lambert
چکیده

In this paper we study the integration of various audio, visual and text-based descriptors for automatic video genre classification. Experimental validation is conducted on 26 video genres specific to web media platforms (e.g. blip.tv).

منابع مشابه

A Multimodal Approach for Video Geocoding

Developed in the context of placing task at MediaEval 2012, this work addresses the problem of automatically assigning geographical coordinates to videos. This year our group extended the implementation of our framework for multimodal geocoding for combining textual and visual descriptors. In this paper, we describe our approach and report the results for 2012 datasets.

متن کامل

MusiClef 2013: Soundtrack Selection for Commercials

MusiClef was one of the“brave new tasks”at MediaEval 2013 with a multimodal approach that combined music, video and textual information in order to evaluate systems that recommend a music soundtrack given the video of a commercial and the information on the product to be advertised.

متن کامل

Tokyo Tech at MediaEval 2016 Multimodal Person Discovery in Broadcast TV task

This paper describes our diarization system for the Multimodal Person Discovery in Broadcast TV task of the MediaEval 2016 Benchmark evaluation campaign [1]. The goal of this task is naming speakers, who are appearing and speaking simultaneously in the video, without prior knowledge. Our diarization system relies on face diarization approach. We extract deep features from a face every 0.5 secon...

متن کامل

RFA at MediaEval 2015 Affective Impact of Movies Task: A Multimodal Approach

The MediaEval 2015 Affective Impact of Movies Task challenged participants to automatically find violent scenes in a set of videos and, also, to predict the affective impact that video content will have on viewers. We propose the use of several multimodal descriptors, such as visual, motion and auditory features, then we fuse their predictions to detect the violent or affective content. Our bes...

متن کامل

ARF @ MediaEval 2012: A Romanian ASR-based Approach to Spoken Term Detection

In this paper, we attempt to resolve the Spoken Term Detection problem for under-resourced languages within the Automatic Speech Recognition (ASR) paradigm. The proposed methods are validated with unseen dataset in African languages.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

متن کامل
عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012